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Abstract 

The purpose of this study is to show some mathematical aspects of the adjoint 
method that is a numerical method for the Cauchy problem, an inverse boundary 
value problem. The adjoint method is an iterative method based on the variational 
formulation, and the steepest descent method minimizes an objective functional de- 
rived from our original problem. The conventional adjoint method is time-consuming 
in numerical computations because of the Armijo criterion, which is used to numer- 
ically determine the step size of the steepest descent method. It is important to find 
explicit conditions for the convergence and the optimal step size. Some theoreti- 
cal results about the convergence for the numerical method are obtained. Through 
numerical experiments, it is concluded that our theories are effective. 

Key words: adjoint method, boundary value identification, Cauchy problem, 
convergence proof, optimal step size, steepest descent method, variational 
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1 Introduction 

The Cauchy problem is known as an inverse problem. This problem is to 
identify unknown boundary value on a part of the boundary of a bounded 
domain for the given boundary data on the rest of the boundary. In this 
sense, the problem is regarded as an inverse boundary value problem. 

A numerical method for solution of the Cauchy problem, proposed by On- 
ishi et al. [3], is based on the variational formulation. Namely, this method 
is constructed by formulating the original problem to a minimization prob- 
lem of a functional. The steepest descent method numerically minimizes the 
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functional. Finally, the numerical method reduces to an iterative process in 
which two direct problems are alternately solved. The numerical method is 
also called the adjoint method, and has been applied to some inverse prob- 
lems. The effectiveness of the method has been shown by numerical examples 
[1], [4]. 

However, mathematical properties of the method have not been clear, yet. Ac- 
tually, although the step size of the steepest descent method is a very impor- 
tant factor to influence convergence speed, it is difficult to obtain theoretically 
the suitable step size in general. Hence, in the conventional adjoint method, 
the step size is numerically determined by the Armijo criterion [5]. But, the 
Armijo criterion requires many evaluations because we have to solve the direct 
problem many times. It is a big disadvantage that the conventional method 
is time-consuming. Moreover, the convergence of this method has not been 
proved, yet. 

In this paper, we will consider an annulus domain for simplicity to prove that 
the estimated boundary value obtained by the adjoint method converges to 
the exact one. Moreover, we will obtain the suitable step size such that the 
convergence becomes more rapid. We will confirm the effectiveness of some 
theoretical results through simple numerical experiments using the finite ele- 
ment method. 



2 Problem Setting 



For a two dimensional annulus domain f2 := {(x, y); Rid 2 < x 2 + y 2 < Rd 2 } 
with the outer boundary T d := {(x, y); x 2 + y 2 = Rd 2 } and the inner one 
Tid := {(x,y); x 2 + y 2 = -Rid 2 }, we consider the Cauchy problem of the Laplace 
equation: 

Problem 1 For given Cauchy data (u,q) € H^ 2 (T d ) x {dv/dn € H' 1 / 2 ^); 
-Av = in n, v\ Fd = u, v\r id = u, uo G # 1/2 (r id )}, find u e # 1/2 (r id ) such 
that 

-Au = in Q, 

du 

u = u, — = q on T d , 
k on 

where n denotes the unit outward normal to T d . 

In engineering and medical science, this is an important problem known as a 
mathematical model of the inverse problem of the electrocardiography, which 
is a problem to identify unknown electric potential on the surface of a human 
heart by observing the electric potential on the surface of a human body. 



2 




Fig. 1. Cauchy problem 
3 Variational Formulation 



To solve Problem 1, we consider the following variational problem based on 
the method of the least squares: 



inf J(u>), 
^effi/2 ( r id) 



Problem 2 Find lu* e # 1/2 (T id ) such that 

where the functional J is defined as 

J(u>) := f 



\V\U3) 



«| 2 dr, 



and v = v{uo) e H l (Q) depending on the given boundary value u £ H 1 ^ 2 ^^) 
is the solution of the following mixed boundary value problem called the pri- 
mary problem: 



(2) 



' -Av 


= 


in 


Q, 


dv 








dn 


= q 


on 


r d , 


V 

V 


= UJ 


on 


r id 



Our strategy is to find the minimum of J by generating a minimizing sequence 
{uJk}^ =0 C i/ 1 / 2 (r id ) by the steepest descent method: 



^k+i — u k — p k J'(uj k ), 



(3) 



starting with an initial guess c^o £ -f^ 2 (rid), with a suitably chosen step size 
{pfc}£L . The derivative J'{oj) is the first variation of J, defined by 



J{u + 5u) - J{u) = (J'(uj), 6u) + o(||<Jw||), 
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where (•, •) and || • || denote the inner product and the norm in L 2 (r id ), respec- 
tively: 

(/,</):= / fgdr, 11/11 :=(/,/) 1/2 . 



The first variation is explicitly given [1], [3] - [5] by 



(4) 



where v = v(v(u)) G H 2 (Q) depending on the solution v = v(u) of the primary 
problem is the solution of the following mixed boundary value problem called 
the adjoint problem: 



f -AO = 

dv 

dn 
v = 



in Q, 
2{v — u) on Td, 

on r id . 



(5) 



We remark that J'(u) G i/ 1/2 (r id ). 



4 Algorithm 



Our numerical method for the Cauchy problem reduces to an iterative process 
to minimize the functional (1) by the steepest descent method (3) after solv- 
ing the primary problem (2) and the adjoint problem (5) to obtain the first 
variation (4). 

For given oj = Uk, we denote the solutions of the primary and the adjoint 
problems by Vk := v(uk), Vk '■— v(v(cuk)), respectively. Then, our algorithm 
can be summarized as follows: 



Algorithm 

Step 0. Pick an initial guess luo G # 1/2 (r id ), and set k : 
Step 1. Solve the primary problem 



= 0. 



Av k 


= 


in 


Q, 


dv k 
dn 


= q 


on 


r d , 




= u k 


on 


r id 



(6) 



to find v k \ Td eiJ 1 /^,; 
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Step 2. Solve the adjoint problem 



{ -m = o 



in Vt, 



< 




2{v k - u) on T d , 



(7) 



v k = 



on r id 



to find the first variation 



J\u k ) 



ddk 
On 



E H 



1/2 (rid). 



Step 3. Choose the step size p k . 

Step 4. Update the boundary value by the steepest descent method 



Step 5. Set k :— k + 1, and go to Step 1. 

In numerical computations, the primary and the adjoint problems in Steps 
1 and 2 are numerically solved by the triangular finite element method. The 
conventional method to choose the suitable step size p k in Step 3 is the Armijo 
criterion, which guarantees for the sequence {pk}^^ to satisfy 



with a constant < £ < 1/2. 
Controlling the step size 

Step 3.0. Give < £ < 1/2, 0<r<l and the sufficiently small e > 0. 
Step 3.1. If ||J'(u; fc )|| < e, then stop; else go to Step 3.2. 
Step 3.2. Set (3 := 1, m := 0. 

Step 3.3. If J(u k - f3 m J'{u k )) < J(u k ) - £/3 m || J'(u k )\\ 2 , then set p k := (3 m ; 

else go to Step 3.4. 
Step 3.4. Set (3 m+1 := r(3 m . 
Step 3.5. Set m := m + 1, and go to Step 3.3. 

To choose the step size, we have to evaluate the functional value on the left 
hand side in (9) many times. It means that the primary problem has to be 
solved many times. This is an disadvantage of the conventional method. 



5 Convergence Proof and the Optimal Step Size 

In this section, we prove that our numerical method is convergent under some 
assumption. Moreover, we propose suitable step sizes to avoid the disadvan- 



^fc+l ^k PkJ'(^k)- 



(8) 



J(uj k - p k J'(uj k )) < J(uj k ) - £p k || J\u k ) || 2 



(9) 
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tage of the conventional method. The following argument is based on the 
convergence proof of the Dirichlet-Neumann alternating method [6], which is 
one of the domain decomposition methods [2]. 

We denote by v* and v* the solutions of the primary and the adjoint problems 
for the boundary value uo = uj*, respectively. Let the error functions e& : = 
v* — Vk, ek := v* — dk, and /ik '■= — uJk- Then, from (6) and (7), we see that 
the functions e k and e k are the solutions of 



(10) 



and 



' -Ae fc 


= 


in 


tt, 


de k 
dn 


= 


on 




e k 


= fa 


on 


Tid, 


' -Ae k 


= 


in 


n, 


de k 
dn 


= 2e fc 


on 


r d , 


e k 


= 


on 


r id 



;n) 



respectively. 



We assume that the error ji k can be expanded into the finite Fourier series: 



N 



\j\=M 



(12) 



with some integers M and N such that N > M > 0. From (10), we have 



N 



e k\r d 



\j\=M 



Substituting (13) into (11), we can derive 



dek 
dn 



r id 



\j\=M l-^id + n d ) 



aje 



ijO 



Here, noting that v* = 0, we have 



J M = -d^ 



r id 



We can see from (8) that 

Hk+l — Hk + PkJ'i^k)- 

Substituting (12) and (15) (namely (14)) into (16), we have 

af +1) = (1 - C^af 



(13) 



(14) 



(15) 



(16) 



(17) 
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with 

Uj - (R id 2ljl + R& m ) 2 ' 

If we take the step size as 

2 

< p k < — , 

then we have 



\Sf\ < <f« = max{|l - C MPk \ , |1 - CWp fc |} < 1 (18) 



with 5f ] := 1 - CjPk and <5 (fe) := jnaxj^. It follows that {\\p k \\}T =0 is a 
strictly monotone decreasing sequence: 



IK + i|| <s^\\p k \\ 

with the compression factor 8^ < 1. Therefore, we obtain that p k — > 0, that 
is, Wfc — > a;* as A; — > oo. 

We notice that the convergence cannot be guaranteed in general because 5^ = 
1 in (18) if pk is expanded into the infinite series. 

As a consequence, we can obtain the following theorem: 

Theorem 1 For the exact boundary value u>*, we assume that the error pu = 
uj* — Uk can be expanded into the finite Fourier series: 



p k = £ 4 )e%j6 (19) 

\j\=M 

with some integers M and N such that N > M > 0. If we take the step size 
as 

n J_ r SR^R^- 1 

Pk< C M ' Cj - (R id ^+R d ^r 
then {uJkj^o converges to uo*. 

Corollary 1 The optimal step size p op t in the sense that the compression 
factor is minimized is given by 

2 

Po P t = n n ■ (20) 

Then, the optimal compression factor 5 opt is given by 

C M - C N 

dopt = r r ■ (21) 
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Proof Taking 

1 1 — CuPk | = 1 1 — CjvPfc I = ^opt 

so as to minimize 5 {k) in (18), we obtain (20) and (21). □ 



The next theorem is very effective in actual computations. 

Theorem 2 The error p k is assumed to be expanded into (19). If we put p k = 
l/C M +k or p k = l/C N - k (k = 0, 1, . . . , N - M), then we have Pn-m+i = 0, 
namely the exact u* is obtained after (N — M + 1) iterations. 



Proof For k = 0,1, . . . , N — M, substituting p k = l/Cu+k (resp. p k = 
1/Cjv-fc) into (17), we have aj^+fc = (resp. = 0). Then, it follows that 

aS+fc = (resp. a$_ fc = 0) for all Z = k + 1, k + 2, . . . , N - M + 1. □ 



6 Numerical Experiments 



Let the radii Rd = 3 and i?id = 1. The domain f2 is divided into triangular 
finite elements with 8552 elements and 4436 nodes. We take uq = as an 
initial guess. The stop condition of our calculations is given by J{uJk) < 10~ 5 . 
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k 



Fig. 2. Functional values 

As the first example, let the Cauchy data be (u, q) = (9 cos 29, 6 cos 29). Then, 
the exact potential in fl and the related exact boundary value on Tid can be 
written as follows: 

u* = r 2 cos 20, 

uo* = u*\r id = COS26 1 . 

We can see from the Cauchy data that M — N — 2. It follows that p opt = 
1/C 2 = 1681/486(^ 3.46) and 5 opt = 0. Hence, theoretically it should hold 
that lo\ = uj*. But, in actual computations, we remark that uj\ ^ uj* due to 
discretization errors by the finite element method. For k > 1, we regard that 
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Fig. 3. The estimated boundary value 

M = and N is sufficiently large. Then, according to Corollary 1, we have 
p opt ~ 2/C = Rid/Rd = 1/3. Hence, we take 



Pk = < 



1681/486(^ 3.46) (k = 0) 
l/3(« 0.33) (A; > f) 



Then, the graph (a) in Figure 2 shows the variations of functional value. Since 
the functional value decreases steeply by taking the step sizes as large as 
possible, the convergence becomes rapid. On the other hand, the graph (b) is 
the result in the case when we apply the Armijo criterion with £ = 1/3 and 
r = 1/2. When the Armijo criterion is applied, only to choose the step sizes, 
the primary problem has to be solved 89 times. Hence, until the estimated 
boundary value converges, in all we have to solve the direct problems 58 (= 
29 x 2) times in the case of the graph (a) and 163(= 37 x 2 + 89) times in the 
case of the graph (b), respectively. We can see that (22) makes convergence 
more rapid than the Armijo criterion. On the other hand, Figure 3 shows the 
estimated boundary value Uk and the exact one u*. We can see that io\ is in 
good agreement with u*. It is concluded that the estimated boundary value 
quickly converges to the exact one. 

As the second example, we assume that the Cauchy data is 

/ 3 9 l 3 \ 

(u, q) = I 6 sin 9 cos 9 + - cos 29, 2 sin 9 cos 9 + - cos 29 J . 

V. 2 4 2 2 / 

Then, the exact potential in fl and the related exact boundary value on Tid 
can be written as follows: 

1 i 

u = r (2 sin 9 — — cos 9) + -r 2 cos 29, 

= u\r., = 2 sin 9 — - cos 9 + - cos 29. 

I J- id o A 
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Fig. 4. Functional values 



We can see from the Cauchy data that M — 1 and N = 2. But, due to 
numerical errors, we assume that M — 0. According to Theorem 2, we take 
the step size as 



Pk 



l/C 2 - fe (A; = 0,1, 2) 
l/3(« 0.33) (A; > 3) 



(23) 



Then, the graph (c) in Figure 4 shows the variations of functional value. As 
we can see, the functional value reaches less than 10~ 5 at k — 5. On the other 
hand, using the Armijo criterion with £ = 1/3 and r = 1/2, we obtain the 
functional value less than 10" 5 at k = 31, which is shown as the graph (a). 
When we apply the Armijo criterion, only to choose the step sizes, the primary 
problem has to be solved 76 times. Hence, until the estimated boundary value 
converges, in all we have to solve the direct problems 10(= 5x2) times in the 
case of the graph (c) and 138(= 31 x 2 + 76) times in the case of the graph 
(a), respectively. If we do not know the concrete values of M and N, it is 
reasonable to assume that M = and N is sufficiently large. Then, the step 
size is taken as p& = l/3(~ 0.33) for all k > according to Corollary 1. The 
variations of functional value for this step size are shown by the graph (b). 
Although the number of iterations for the graph (b) is greater than that for 
the graph (a), the computational cost for (b) is less than that for (a) in all. 



Figure 5 shows the estimated boundary value uj k and the exact one on*. We 
can see that uj 2 is roughly the same as uj* . 



Therefore, through two examples, it is concluded that the computational cost 
can reduce and the estimated boundary value quickly converges to the ex- 
act one by applying Corollary 1 and Theorem 2, which are very effective in 
numerical computations. 
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Fig. 5. The estimated boundary value ujk 
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